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Loudspeaker Sensitive Sound Reproduction 

The present invention relates to a reproduction of sound or music files with a small size 
loudspeaker, particularly with a small size loudspeaker comprised in a mobile terminal 
10 of a wireless communication system. 

Many mobile terminals, as for instance a Personal Digital Assistant (PDA), a mobile 
phone, a portable computer or the like, allow to store polyphonic audio data of sound 
or music for many purposes, e.g. to output acoustical signals, alarm or ringer tones, 

15 background sounds for games or for personal entertainment. The sound is reproduced 
from the audio data by use of one or more types of electroacoustic transducers, whereby 
it is possible on many appliances to select a particular transducer as e.g. a loudspeaker 
or an earphone for the reproduction. In the context of this application, the term 
'loudspeaker' refers to an electroacoustic transducer radiating sound in an open volume, 

20 while the term 'earphone 1 denotes an electroacoustic transducer designed to radiate the 
sound into a small confined volume. 

Sound or music data can be stored in different file formats. Two types of files are 
currently popular. A first type contains sampled audio data in either uncompressed form 

25 like a 'wave' -file or in a compressed form like an 'mp3'-file. Although the compression 
reduces the file size by about a magnitude, the very limited storage space on most 
mobile terminals yet requires smaller sound or music files. This requirement is fulfilled 
by music score files which store the sound in form of data representing musical notes, 
instrumentation and articulation information. The most popular respective data format is 

30 the MIDI data format. 

A MIDI file contains a specification on how the sound is to be rendered. It can be 
regarded as a sheet of music in an electronically legible format. It contains information 
about the soundtrack, the devices being used and the acoustical parameters which have 
35 to be considered when reproducing the score represented by the data stored in the 
respective MIDI file. The collective term acoustic parameter denotes statements 
defining for instance the pitch, the note or rest values, respectively, the loudness level, 
the tempus, the timbre or special effects like vibrato or reverberation. 
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To turn a music score file like a MIDI file into sound, the information present in the 
file has to be interpreted and formed to data representing a sampled, digital sound. For 
this purpose, a so-called polyphonic synthesiser is used, which renders the score 
contained in the file to sampled data like e.g. those used in a mono or stereo wave-file. 
5 The polyphonic synthesiser may be implemented in software in a digital signal 
processor or in a separate dedicated hardware. 

The rendering of the score is usually based on so-called parameter files like for instance 
a wave table. A parameter file contains sound samples of an instrument like for instance 
10 that of a certain piano in form of digitally sampled data. Each sound sample contains 
one or more periods of the time representative signal of an instrument to be simulated. 
Spectral parameter files further keep the frequencies, phases and harmonics of an 
instrument ready for simulation. 

15 In a different method called FM (Frequency Modulated) synthesis different spectra are 
used for the synthesis with each spectrum representing a particular instrument to be 
simulated. The spectra are generated by a sine wave frequency modulated with further 
sine wave signals, which may again be modulated with still another sine wave signal 
and so on. The spectra are stored and thus made available in a FM-spectra file. 

20 

In mobile terminals, pieces of music are preferably kept in memory in form of MIDI 
files, as the size of MIDI files is extremely small compared to files containing sampled 
audio data. A PCM (Pulse Code Modulation) format audio file, like for example a 
".wav M -file uses up to 10 Megabyte per minute of music while the same music can be 
25 stored in a MIDI file of less than 10 Kilobyte. This is possible, like already mentioned 
above, since the MIDI file contains only the instructions needed by a polyphonic, so 
called MIDI synthesiser to reconstruct the respective sound and do not need the sound 
data itself 

30 Loudspeakers used in audio applications are expected to reproduce the sound as close as 
possible to the original sound. This is only possible in the flat response region above the 
resonance frequency of a loudspeaker where its impedance is nearly ohmic. The lower 
the resonance frequency of a loudspeaker is, the lower is the lowest frequency which 
can be reproduced closely to the original. Loudspeakers with a low resonant frequency 

35 are typically much larger than the limited space usually available in mobile terminals. 



A loudspeaker that fits in the limited space inside a mobile terminal typically has a 
resonant frequency between about 500 Hz and about 2 kHz. Audio frequencies below 
the resonant frequency are thereby considerably attenuated when being converted by the 
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loudspeaker into sound. As a result, the bass portion of a composition is cutoff spoiling 
the pleasure of listening to it considerably. 

A loudspeaker with a lower cut-off frequency of about or above 300 Hz is in the 
5 ongoing referred to as 'small size loudspeaker'. The lower cut-off frequency of a 
loudspeaker is defined as the frequency below which an electrical audio signal is 
converted with a lower efficiency than one with a frequency above it. It is usually close 
to the resonance frequency, but not identical with it. 

10 A considerable and very often a major part of the audio energy present in music pieces 
and polyphonic sounds is found in the lower frequency spectrum. A reproduction of 
respective audio data with a small size loudspeaker will muffle the sound components in > 
that lower frequency spectrum. The resulting music experience is thus inevitably 
unpleasant as most of the richness of the sound got lost. In some cases, a piece of music 

15 is even rendered unrecognisable. 

It is therefore an object of the present invention to improve the audibility of sound 
represented by audio data of a music file when reproduced by a small size loudspeaker. 

20 The above object is achieved by the invention as defined in the independent claims. 

The invention comprises a method for modifying a reproduction of a music file 
according to a transmission characteristic of a loudspeaker of a mobile terminal of a 
wireless communication system according to which audio data are identified in the 
25 music file which represent a sound with a spectral component below the transmission 
frequency of the loudspeaker, and according to which a reproduction of sound from the 
identified audio data is modified such, that the modified reproduction yields a sound 
spectrum having an increased energy content within the transmission frequency range of 
the loudspeaker as compared to sound obtained by an unmodified reproduction. 

30 

The invention is further represented by an apparatus for rendering sampled data from a 
music file according to a transmission characteristic of a loudspeaker of a mobile 
terminal of a wireless communication system, whereby the apparatus contains storage 
means for storing the music file and data related to the transmission characteristic of 
35 one or more loudspeaker, selection means for selecting data for a particular loudspeaker 
from the storage means, low frequency sound identification means for identifying audio 
data in the music file which represent a sound with a spectral component below the 
transmission frequency range of a loudspeaker according to the selected data, control 
means for controlling a modification of a reproduction of sound from the identified 



WO 2005/032208 PCT/EP2004/009966 

4 

audio data such, that the modified reproduction yields a sound spectrum having an 
increased energy content within the transmission frequency range of the loudspeaker as 
compared to sound obtained by an unmodified reproduction, and synthesising means for 
synthesising sampled data from the modified music score. 

5 

The above object is further achieved by a mobile terminal for use with a wireless 
communication system and adapted to reproduce audio data from a music file, with the 
mobile terminal comprising an apparatus according to the present invention for 
rendering sampled data from the music file, a transformation means for transforming 
10 the sampled data obtained from the apparatus into a respective analogue electrical 

signal, and a loudspeaker for converting the analogue electrical signal into a respective 
sound signal. 

Furthermore the above object is achieved by means of a software product comprising a 
15 series of state elements which are adapted to be processed by a data processing means 
of a mobile terminal such, that a method according to one of the claims 1 to 9 may be 
executed thereon. 

The present invention advantageously uses the ability of the human hearing to perceive 
20 the fundamental frequency of a complex sound signal even when the fundamental 

frequency is not part of the audio spectrum received by the ear. Accordingly, a listener 
is able to virtually hear the correct pitch of a sound as long as the overtones allow the 
hearing to 1 reconstruct ' the fundamental tone. This peculiarity of the human hearing is 
called virtual pitch and described e.g. in E. Zwicker and H. Fasti, Psychoacoustics, 
25 Facts and Models, Springer Verlag Berlin Heidelberg, New York, London, Paris, 
Tokyo, Hong Kong, Barcelona, 1990, p 110-116. By shifting the main energy of a 
sound spectrum reproduced from a music file into the transmission range of a 
loudspeaker, particularly a small size loudspeaker, the human hearing is supported in 
reconstructing the not audible parts of the sound spectrum. Further, the radiation 
30 efficiency of a loudspeaker is increased by emphasising the higher frequency part of a 
sound spectrum resulting in an improved audibility of the reproduced sound. 

Advantageous embodiments of the present invention are the subject of other claims. 

35 In a preferred embodiment of the present invention, the modification of the 

reproduction is based on the use of a modified parameter file like e.g. a modified wave 
table containing samples with a frequency spectrum adapted to the transmission 
frequency range of a small size loudspeaker used. When not using parameter files for 
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the music reproduction but an FM synthesis, the modified reproduction of sound is 
instead based on a likewise modified FM-spectra file. 

The present invention may further be implemented by basing the modification of a 
5 reproduction of sound on swapping a specification given in the music file for an 
instrument used to reproduce sound from the identified audio data by a substitute 
specification of an instrument with brighter timbre. The instrument of the substitute 
specification hereby advantageously belongs to the same category of instruments as the 
originally specified instrument to maintain the characteristics of the arrangement of the 
10 replayed piece of music. In a further development, if more than one substitute 

specification is available for being swapped with an original specification in the music 
file, the substitute specification is selected based on the register in which the originally 
specified instrument is to be replayed. By this the reproduction is optimised with respect 
to peculiarities of particular sequences of a score. 

15 

A straightforward modification of a sound reproduction is based on a transposition of a 
sound spectrum to a higher frequency range for retaining the characteristic of the music 
score but shifting the main energy content of the sound spectra into the transmission 
range of the reproducing loudspeaker. This is most effective for the shift interval being 
20 adapted to relocate the lower end of the sound spectrum within the transmission 
frequency range of the loudspeaker. 

For achieving a high sound pressure with the loudspeaker, the main energy content of 
the transposed sound spectrum is effectively located within a frequency range from 
25 5 kHz to 10 kHz. 

The format of the music score file preferably corresponds to a MIDI data file format, 
since this format is the most popular when small size music files are required. 

30 The methods explained for modifying the music score are best put into practice by the 
control means of the apparatus according to the invention. The control means may 
further store the modified music score in a music score file in the storage means of the 
apparatus for reuse with a particular loudspeaker. In a preferred embodiment of the 
present invention, the control means modifies the music score at the time sampled data 

35 are rendered from the music score file. 

An adaptation of a reproduction of a music file to the transmission characteristic of an 
electroacoustic transducer offers the opportunity to reproduce polyphonic compositions 
with a small size loudspeaker and a sound quality which is comparable to that of a high 
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fidelity transducer. This is particularly relevant for mobile terminals of wireless 
communication systems which are too small to integrate larger size loudspeakers 
offering a wide enough transmission range. The small size loudspeakers which are 
typically used in theses appliances are not capable of reproducing the complex sounds 
5 produced by a series of instruments so that it is sometimes hard to recognise a certain 
piece of music when replayed on the respective mobile terminal. 



In the following description, the present invention is explained in more detail with 
respect to special embodiments and in relation to the enclosed drawings, in which 

10 

Figure 1 shows the audio signal of a clarinet in a time and a frequency 
representation, 



Figure 2 shows a flow chart of a method according to the present invention, 

15 

Figure 3a shows the frequency spectrum of the audio signal according to Fig. 1 in 
relation to the transmission characteristic of a small size loudspeaker, 

Figure 3b shows the frequency spectrum of a sound signal resulting from converting 
20 the audio signal of Fig. 3a with a small size loudspeaker of indicated 

transmission characteristic, 



Figure 4 shows the frequency spectrum of the audio signal of Fig. 3a transposed 
according to an embodiment of the present invention, 

25 

Figure 5 shows an apparatus according to the present invention. 



An audio signal 1 obtained from a clarinet is shown in Figure 1. The upper 
representation illustrates the time representation la of the audio signal 1 for about two 
30 signal periods while the lower diagram shows the associated frequency distribution lb. 
It is understood from the representations that the audio signal 1 is composed of a 
fundamental oscillation 10 superimposed by several harmonics 11, 12, 13, 14 and so on 
forming a complex tone mixture, a so called sound. 

35 In accordance with the notation used in the relevant literature, a tone denotes a pure 
sine wave oscillation with a single spectral line. The colloquial expression 'tone' refers 
to a superposition of several sine wave oscillations and is correctly referred to as a 
'sound' like it is done in this specification. Sound is a strictly periodic event with a 
discontinuous frequency spectrum. It is composed of a fundamental tone 10 and several 
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overtones 11, 12, 13, 14 etc. having audio frequencies which are an integer multiple of 
the fundamental audio frequency. The ratio of the fundamental tone intensity to that of 
each overtones is characteristic for a certain instrument. It defines the timbre or tone 
colour of the respective instrument. The fundamental frequency of a sound is 
5 responsible for the perception of the pitch. 



The human ear works like a Fourier analyser performing a harmonic analysis on a 
sound received. This enables it to deduce the fundamental tone of a sound perceived 
based on the detected overtones. To some degree the ear is even able to perceive the 
10 original sound when the fundamental tone is completely missing in the received audio 
spectrum. The thus perceived correct pitch of the incompletely received sound is called 
virtual pitch. A detailed description related to the virtual pitch is given in E. Zwicker 
and H. Fasti, Psychoacoustics, Facts and Models, Springer Verlag Berlin Heidelberg, 
New York, London, Paris, Tokyo, Hong Kong, Barcelona, 1990, p 110-116. 

15 

Without the ability of the human ear to complete a missing spectral component of a 
sound by a virtual pitch it would not be possible to detect the intonation of a human 
i voice transmitted over a telephone line. The fundamental tones of a human voice, 
particularly a male one, are outside the transmission frequency range from 300 Hz to 
20 3.4 kHz used for telephone lines. Only the virtual pitch completion enables a human to 
perceive the intonation of the voice. 

One condition to be observed for perceiving a virtual pitch is a sufficient energy content 
of the audible spectral components of a sound. In other words, when too many of the 
25 overtones of a sound are muffled during a reproduction of the sound, the virtual pitch 
capability of the human ear may not be able to replace the missing part of the audio 
spectrum. 

When reproducing a music score from a music score file, like for instance a MIDI file, 
30 with a small size loudspeaker cutting off the low frequency tones, the original richness 
of the sound can be restored by the human ear if the intensity of the associated 
overtones is high enough. According to the present invention, the reproduction of a 
music file containing the score is modified prior to being reproduced according to a 
method, the basic steps of which are illustrated in the flow chart of Figure 2. 

35 

After selecting from e.g. a memory of a mobile terminal in step SO a certain music file 
for reproduction on that mobile terminal which has one or more electroacoustic 
transducers, data concerning the transmission characteristics of the transducer used for 
the replay are made available in step SI. If the electroacoustic transducer selected for 
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the replay is a loudspeaker according to the above given definition, the lower cut-off 
frequency is determined in the following step S2 from the data made available 
beforehand. 

5 Next, the music score is browsed in step S3 for audio data representing notes 
corresponding to a low frequency sound, that is a sound with a fundamental tone 
located below the transmission frequency range of the selected loudspeaker, and thus 
below its lower cut-off frequency. The reproduction of at least the audio data 
corresponding to the respective low frequency sounds is then modified in step S4 by 

10 defining a reproduction mode which results in an increased energy content of the sound 
spectrum within the transmission frequency range of the loudspeaker. This can be 
accomplished by either modifying the content of the audio data itself or by modifying 
the assignment to parameter or FM files used for synthesising the music file. After 
modification, the music score is passed on to the polyphonic synthesising process in 

15 stepS5. 

An effective way to modify the reproduction of a music file is to modify the assignment 
to a parameter file like e.g. a wave table or to a FM-file used for synthesising. These 
files can be generally referred to as synthesis base files, as both types form a base for 
20 interpreting the sound information in the music file by the polyphonic synthesiser. 

Assumed that a first synthesis base file intended to be used for reproducing a music file 
with a fidelity close to its original is used for replaying the music file with a small size 
loudspeaker, then the lower frequency components of the sound will be cut off 
rendering the replayed music unpleasant or even unbearable. 

25 

This effect is shown with respect to Figures 3a and 3b. Figure 3a shows the frequency 
spectrum lb comprising the fundamental tone 10 and the first four overtones 11, 12, 
13, and 14 of the clarinet sound according to Figure 1 . The representation is given in 
intensity I versus frequency f of the respective component. The frequencies of the 

30 overtones f,, f 2 > f 3 , and f 4 each are an integer multiple of the fundamental frequency f 0 . 
The dashed line 2 indicates the transmission characteristic of a small size loudspeaker 
used for a reproduction of the underlying sound. It is shown in a semi-logarithmic 
representation of its gain g versus transmission frequency f. Figure 3b shows the 
frequency spectrum of the sound as reproduced by a small size loudspeaker in sound 

35 pressure intensity I* versus transmission frequency f. Again, the transmission 

characteristic 2 of the small size loudspeaker is indicated with a dashed line. As can be 
seen from the representation, the intensity of the replayed fundamental tone 30 is 
suppressed with respect to the replayed overtones 31, 32, 33, 34, and 35. 
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To overcome the sound distortion, a second synthesis base file is provided on the 
mobile terminal which contains sound samples of brighter timbre. In these samples, the 
spectral energy of the sounds is in general shifted to the higher harmonics as compared 
to the samples of the original first synthesis base file. The second synthesis base file is 
5 used for synthesising the audio data of the music file each time the music file is 
replayed with the small size loudspeaker. 

Keeping different synthesis base files available advantageously allows to adapt the 
reproduction of a music file to a variety of loudspeakers with different transmission 
10 characteristics. Particularly when using an earphone, the first synthesis base file 

containing the samples for a high fidelity replay of the music file will be used, while 
the second synthesis base file with the sound samples of brighter timbre will e.g. be 
selected for a replay via the hands-free loudspeaker of a mobile terminal. 

15 In a further embodiment of the present invention, the specification given in a music 
score file for an instrument to be used for reproducing a certain sequence of low 
frequency sounding notes is swapped for a substitute specification defining a different 
instrument of brighter timbre for the reproduction. In other words, the registration of an 
instrument is swapped when the main audio energy is in the fundamental tones for low 

20 frequency sounds. As the instrument of the substitute specification has a brighter 

timbre, its sound energy represented by the overtone spectrum is generally higher than 
that of the originally registered instrument. Muffled sounding instruments which are 
registered or specified for a particular sequence are thus replaced by similar but brighter 
sounding, more crisp instruments. A certain type of piano could for instance be 

25 replaced by a brighter one or possibly even a harpsichord. 

The respective instrument replacement has not necessarily be carried out throughout the 
whole music score, but may advantageously be limited to certain voices of the score in 
which the instruments to be considered are played in a low register. The score is 
30 therefore scanned for voices with low notes played on muffled instruments which 
produce no considerable audio energy in the transmission frequency range of the 
loudspeaker. Voices in higher registers are then left with the instrument specification 
unchanged, whereas e.g. a flute in a low voice will be replaced by an oboe to improve 
the audibility of this voice when reproduced with a small size loudspeaker. 

35 

The instrument replacement or re- specification, respectively, is accomplished 
loudspeaker specifically. That means, that the type of instrument substituting the 
originally specified instrument depends on the transmission characteristic of the 
loudspeaker actually used. Hereto a cross reference table is preferably stored on the 
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reproducing device. The cross reference table contains a list of possible substitute 
instrument specifications for a variety of original instrument specifications, whereby 
each particular substitute specification is assigned to a loudspeaker with a particular 
transmission characteristic. As the choice of the best suited replacement instrument 
5 depends on the lower cut-off frequency f c of the electroacoustic transducer used for 
reproducing the score, several replacement or substitute instruments, respectively, are 
kept in store for an originally registered instrument. Priority is hereby given to 
replacement instruments belonging to the same category of instalments as the originally 
specified. If this will not render the expected results, specifications of instruments 
10 belonging to the same class will be preferred, e.g. a violin will be used to replace a 
viola. 

In a further favourable embodiment of the present invention the score is adapted as a 
whole to the transmission characteristic of a small size loudspeaker by transposing it 

15 upwards by a certain frequency interval. The frequency interval is substantially defined 
by the frequency spacing between the fundamental tone of a low frequency sound and 
the cut-off frequency of the small size loudspeaker. By transposing the music score 
upwards, all or at least nearly all frequencies in the sound spectrum are shifted upwards 
into the frequency range of the small size loudspeaker as shown in Figure 4. The 

20 fundamental tone 10 of the original sound spectrum shown in Figure 3a is shifted in the 
illustrated example above the lower cut-off frequency f c at the new frequency f',. The 
overtones are shifted accordingly by the same frequency interval r r f, to the new 
locations f 2 , P 3f and f 4 , respectively. As a result the main energy of the produced 
sound signal is located within the transmission frequency range of the small size 

25 loudspeaker. 

The transposition interval may be chosen dependent on the arrangement of the score. 
Music arranged relatively low will be transposed by a bigger interval than music which 
is already arranged fairly high with only a few registers showing low frequency sounds. 

30 

The present invention is not restricted to only enable a pleasant reproduction of 
polyphonic scores. Particularly when reproducing polyphonic scores which are used as 
ringer tones or alarm signals or the like, it is necessary to use the frequency range of 
the loudspeaker wherein it shows the highest degree of efficiency. Only then are the 
35 highest sound pressures possible achieved allowing the respective sound to be reliably 
perceived even in noisy environments or when the mobile terminal is kept in a pocket 
or bag. Small size loudspeaker are most efficient in the frequency range of 5 kHz to 
10 kHz. Therefore, the reproduction of polyphonic ringer or alarm tones will be 
modified according to one of the methods described above with the condition, that the 
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main energy content of the thus produced sound spectrum falls within the frequency 
range from 5 kHz to 10 kHz. 

The modification of a music score according to the above specified is implemented by 
5 an apparatus 100 shown in Figure 7. It comprises a storage means 101 for storing one 
or more music score files e.g. in the form of a MIDI file, data related to the 
transmission characteristic of one or more electroacoustic transducers like for instance a 
small size loudspeaker which is available on a reproducing appliance containing the 
apparatus 100. It may further store data related to the synthesising and related to the 
10 reproduction modification process, like for instance the above described cross-reference 
table or adjusted FM parameter and wave tables. 

A selection means 102 is further provided which is adapted to select data concerning the 
characteristic of a particular electroacoustic transducer chosen by a user for the 

15 reproduction of score. Since many mobile terminals provide more than one 

electroacoustic transducer for sound reproduction, e.g. high fidelity headphones in 
addition to an earpiece, a small size loudspeaker, or accessories like a desktop stand 
with a loudspeaker, a user is enabled to pick out one he thinks is best for the given 
situation. If a user chooses high fidelity headphones for listening to a music score, the 

20 selection means 102 will select the data containing the information about the 

reproduction frequency range of the respective headphones from the storage means 101. 
As the chosen headphones are capable of reproducing the full audio frequency 
spectrum, the music score will not be modified but replayed in its original version. 
When using a small size loudspeaker for replay, the information defining its 

25 transmission characteristic is selected from the storage means 101, and the music score 
reproduction is then modified accordingly. 

A music score may contain any type of sound data, like e.g. a polyphonic ringer signal, 
a signature tune, a piece of music or the like. Music scores serving an indication of a 

30 particular function on the appliance are usually pre-assigned to that function by either 
the user or the manufacturer of the appliance. Others, particularly those which are 
replayed for entertainment are usually selected manually by a user or downloaded e.g. 
via WAP (Wireless Application Protocol) for reproduction. When selected for 
reproduction by one of the described options, a low frequency sound identification 

35 means 103 searches the music score file for audio data or notes, respectively, which 
correspond to a low frequency sound having a fundamental tone of an audio frequency 
below the transmission frequency range of the beforehand selected loudspeaker. 
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The modification of sound reproduction according to one of the procedures described N 
above is controlled by the control means 104. After modification according to the data 
selected previously, the energy content of the sound spectrum is shifted for better 
overlapping with the transmission frequency range of the loudspeaker. The control 
5 means 104 may effectively be realised with a processing means so that the above 
specified procedures may be implemented in software for being carried out by the 
control means 104. The final synthesising of the modified and/or unmodified audio or 
sound data into sampled audio data is accomplished by the synthesising means 105. 

10 The software for implementing the control means 104 may further be kept in a physical 
autonomous form like on a data carrier or a data stream. 

The apparatus 100 is preferably part of a mobile terminal which transforms the sampled 
audio data produced by the synthesising means 105 into respective analogue electrical 
15 signal which are further converted into a corresponding sound signal by a small size 
loudspeaker of the mobile terminal. 

The control means 104 may also use the modification parameter for modifying the 
audio data present in the music file. The modified audio data can be restored to replace 

20 the original audio data in the original music file but preferably a new copy of the music 
file is created instead in the storage means 101 containing the modified audio data. This 
file is later reused for replaying the respective piece of music with the respective small 
size loudspeaker. This solution is particularly preferred for polyphonic signature tunes 
which are often used on a mobile terminal. As the original copy of the music file is still 

25 available on the storage means 101, it is possible to connect HiFi headphones with a 
wide reproduction frequency range to the mobile terminal and to reproduce the music 
file without any modification as this would only degrade the possible fidelity. In this 
case, the original music score is treated to be already optimised for reproduction with 
the sound spectra as close as possible to the sound of the real instruments. 

30 

For most of the music files the modification of the music file reproduction takes 
preferably place at the time the synthesising means renders sampled data from it. In a 
further embodiment of the present invention, the modification of the music file may be 
accomplished in an idle condition of the mobile terminal when only a few of its 
35 resources are used. Both methods enable an adaptation of a music file reproduction to 
the transmission characteristics of a loudspeaker without having to store the same piece 
of music in different versions on the mobile terminal. 



